<..no one will be able to track usage..> It will be harder, BUT---
VOIP has distinct signatures in backbone networks. First of all, there is the H.323 call setup and negotiation; that is TCP. Next is the actual VOIP data and that is RTP/UDP and these are 60 Byte ip datagrams, encapsulated in whatever the frame encapsulation is, ie. HDLC or Frame Relay or whatever. The complexity arises with VPNs and the methods used for that. (ip-in-ip, ie. tunnels, or "Tag", or whatever you want to call it--we have had this particular discussion before ;-) ).
So, you will be able to get metrics but it is a bit harder. An ip datagram is an ip datagram, maybe handled differently depending on some "service qualifier", but an ip datagram none-the-less. Traditional POTs is circuit switched. Modern POTs is VOIP and that is packet switched (or routed--tis the same).
In the long run, POTs voice will be VOIP and just another ip datagram, along with all the others. Traditional ip data today exceeds voice bits even of the circuit switched variety. This market is HUGE !!
All IMHO, of course--- |