Message ID | e253843e07ce5d15c2c8d11ce786bf979ed85ca5.1282550649.git.richard.cochran@omicron.at |
---|---|
State | Not Applicable, archived |
Delegated to: | David Miller |
Headers | show |
On Mon, Aug 23, 2010 at 04:16, Richard Cochran wrote: > --- a/arch/blackfin/include/asm/unistd.h > +++ b/arch/blackfin/include/asm/unistd.h > @@ -389,8 +389,9 @@ > #define __NR_rt_tgsigqueueinfo 368 > #define __NR_perf_event_open 369 > #define __NR_recvmmsg 370 > +#define __NR_clock_adjtime 371 > > -#define __NR_syscall 371 > +#define __NR_syscall 372 > #define NR_syscalls __NR_syscall > > /* Old optional stuff no one actually uses */ > --- a/arch/blackfin/mach-common/entry.S > +++ b/arch/blackfin/mach-common/entry.S > @@ -1628,6 +1628,7 @@ ENTRY(_sys_call_table) > .long _sys_rt_tgsigqueueinfo > .long _sys_perf_event_open > .long _sys_recvmmsg /* 370 */ > + .long _sys_clock_adjtime > > .rept NR_syscalls-(.-_sys_call_table)/4 > .long _sys_ni_syscall FYI, this is going to hit a conflict as i'm about to push out an update to wire up the new 2.6.36 syscalls -mike -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Mon, Aug 23, 2010 at 10:16, Richard Cochran <richardcochran@gmail.com> wrote: > diff --git a/kernel/compat.c b/kernel/compat.c > index 5adab05..df1e469 100644 > --- a/kernel/compat.c > +++ b/kernel/compat.c > @@ -628,6 +628,26 @@ long compat_sys_clock_gettime(clockid_t which_clock, > return err; > } > > +long compat_sys_clock_adjtime(clockid_t which_clock, int ppb, > + struct compat_timespec __user *tp) > +{ > + long err; > + mm_segment_t oldfs; > + struct timespec ts, *ptr = NULL; Shouldn't ptr be initialized with tp? > + > + if (tp) { > + if (get_compat_timespec(&ts, tp)) > + return -EFAULT; > + ptr = &ts; > + } > + oldfs = get_fs(); > + set_fs(KERNEL_DS); > + err = sys_clock_adjtime(which_clock, ppb, > + (struct timespec __user *) ptr); > + set_fs(oldfs); > + return err; > +} > + > long compat_sys_clock_getres(clockid_t which_clock, > struct compat_timespec __user *tp) > { -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Mon, Aug 23, 2010 at 04:22:58AM -0400, Mike Frysinger wrote: > FYI, this is going to hit a conflict as i'm about to push out an > update to wire up the new 2.6.36 syscalls Thanks for the "heads up." At this point, the patch is meant just to generate discussion and feedback. Thanks, Richard -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Mon, Aug 23, 2010 at 10:25:35AM +0200, Bert Wesarg wrote: > On Mon, Aug 23, 2010 at 10:16, Richard Cochran <richardcochran@gmail.com> wrote: > > diff --git a/kernel/compat.c b/kernel/compat.c > > index 5adab05..df1e469 100644 > > --- a/kernel/compat.c > > +++ b/kernel/compat.c > > @@ -628,6 +628,26 @@ long compat_sys_clock_gettime(clockid_t which_clock, > > return err; > > } > > > > +long compat_sys_clock_adjtime(clockid_t which_clock, int ppb, > > + struct compat_timespec __user *tp) > > +{ > > + long err; > > + mm_segment_t oldfs; > > + struct timespec ts, *ptr = NULL; > > Shouldn't ptr be initialized with tp? It could be, but the logic turns out the same either way. The semantics of the call is, if 'tp' is NULL, then adjust the frequency by 'ppb', otherwise adjust clock time by 'tp'. > > > + > > + if (tp) { > > + if (get_compat_timespec(&ts, tp)) > > + return -EFAULT; > > + ptr = &ts; > > + } > > + oldfs = get_fs(); > > + set_fs(KERNEL_DS); > > + err = sys_clock_adjtime(which_clock, ppb, > > + (struct timespec __user *) ptr); > > + set_fs(oldfs); > > + return err; > > +} > > + > > long compat_sys_clock_getres(clockid_t which_clock, > > struct compat_timespec __user *tp) > > { -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Monday 23 August 2010, Richard Cochran wrote: > A new syscall is introduced that allows tuning of a POSIX clock. The > syscall is implemented for four architectures: arm, blackfin, powerpc, > and x86. > > The new syscall, clock_adjtime, takes two parameters, a frequency > adjustment in parts per billion, and a pointer to a struct timespec > containing the clock offset. If the pointer is NULL, a frequency > adjustment is performed. Otherwise, the clock offset is immediately > corrected by skipping to the new time value. It looks well-implemented, and seems to be a reasonable extension to the clock API. I'm looking forward to your ptp patches on top of this to see how it all fits together. For new syscalls, it's best to take linux-api on Cc. I also added John, since he participated in the discussion. > In addtion, the patch provides way to unregister a posix clock. This > function is need to support posix clocks implemented as modules. This part should probably be a separate patch, and you need to add some form of serialization here to avoid races between the clock system calls and the unregistration. > diff --git a/kernel/posix-cpu-timers.c b/kernel/posix-cpu-timers.c > index 9829646..5843f5a 100644 > --- a/kernel/posix-cpu-timers.c > +++ b/kernel/posix-cpu-timers.c > @@ -207,6 +207,11 @@ int posix_cpu_clock_set(const clockid_t which_clock, const struct timespec *tp) > return error; > } > > +int posix_cpu_clock_adj(const clockid_t which_clock, int ppb, > + struct timespec *tp) > +{ > + return -EOPNOTSUPP; > +} EOPNOTSUPP is specific to sockets, better use -EINVAL here. Where do you use this function? > /* > * Sample a per-thread clock for the given task. > diff --git a/kernel/posix-timers.c b/kernel/posix-timers.c > index ad72342..089b0d1 100644 > --- a/kernel/posix-timers.c > +++ b/kernel/posix-timers.c > @@ -197,6 +197,12 @@ static int common_timer_create(struct k_itimer *new_timer) > return 0; > } > > +static inline int common_clock_adj(const clockid_t which_clock, int ppb, > + struct timespec *tp) > +{ > + return -EOPNOTSUPP; > +} > + > static int no_timer_create(struct k_itimer *new_timer) > { > return -EOPNOTSUPP; So we already return -EOPNOTSUPP in some cases? The man page does not document this. I wonder if we should change that to -EINVAL as well. > @@ -488,6 +494,21 @@ void register_posix_clock(const clockid_t clock_id, struct k_clock *new_clock) > } > EXPORT_SYMBOL_GPL(register_posix_clock); > > +void unregister_posix_clock(const clockid_t clock_id) > +{ > + struct k_clock *clock; > + > + if ((unsigned) clock_id >= MAX_CLOCKS) { > + pr_err("POSIX clock unregister failed for clock_id %d\n", > + clock_id); > + return; > + } > + > + clock = &posix_clocks[clock_id]; > + memset(clock, 0, sizeof(*clock)); > +} > +EXPORT_SYMBOL_GPL(unregister_posix_clock); > + It would be possible to add locks here to serialize unregistration of a clock against dereferencing members of posix_clocks[], but that would cause noticable overhead. A better alternative might be to make it an RCU-protected array of pointers, and use a rcu_assign_pointer/rcu_syncronize/kfree or call_rcu sequence in unregister_posix_clock. Or you just live with not being able to unload this module. Arnd -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Mon, Aug 23, 2010 at 02:57:26PM +0200, Arnd Bergmann wrote: > > +static inline int common_clock_adj(const clockid_t which_clock, int ppb, > > + struct timespec *tp) > > +{ > > + return -EOPNOTSUPP; > > +} > > + > > static int no_timer_create(struct k_itimer *new_timer) > > { > > return -EOPNOTSUPP; > > So we already return -EOPNOTSUPP in some cases? The man page does not document this. > I wonder if we should change that to -EINVAL as well. ENOTTY is the usual errno for "inappropriate ioctl for device". Due to the way this patch has been chopped up, I can't tell if that's what is intended here.
On Monday 23 August 2010, Matthew Wilcox wrote: > On Mon, Aug 23, 2010 at 02:57:26PM +0200, Arnd Bergmann wrote: > > > +static inline int common_clock_adj(const clockid_t which_clock, int ppb, > > > + struct timespec *tp) > > > +{ > > > + return -EOPNOTSUPP; > > > +} > > > + > > > static int no_timer_create(struct k_itimer *new_timer) > > > { > > > return -EOPNOTSUPP; > > > > So we already return -EOPNOTSUPP in some cases? The man page does not document this. > > I wonder if we should change that to -EINVAL as well. > > ENOTTY is the usual errno for "inappropriate ioctl for device". Due to > the way this patch has been chopped up, I can't tell if that's what is > intended here. It's for the CLOCK_* syscall family, which I think is different enough from an ioctl that ENOTTY makes no sense. The documented return values of timer_create() are EAGAIN, EINVAL and ENOMEM. Arnd -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
EOPNOTSUPP is also called ENOTSUP in userland. ENOTSUP is the appropriate POSIX errno code for a situation such as a clock type that cannot be used in a certain call (like setting when you can only read it, etc.). Thanks, Roland -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Mon, 2010-08-23 at 14:57 +0200, Arnd Bergmann wrote: > On Monday 23 August 2010, Richard Cochran wrote: > > A new syscall is introduced that allows tuning of a POSIX clock. The > > syscall is implemented for four architectures: arm, blackfin, powerpc, > > and x86. > > > > The new syscall, clock_adjtime, takes two parameters, a frequency > > adjustment in parts per billion, and a pointer to a struct timespec > > containing the clock offset. If the pointer is NULL, a frequency > > adjustment is performed. Otherwise, the clock offset is immediately > > corrected by skipping to the new time value. > > It looks well-implemented, and seems to be a reasonable extension > to the clock API. I'm looking forward to your ptp patches on top > of this to see how it all fits together. > > For new syscalls, it's best to take linux-api on Cc. I also added > John, since he participated in the discussion. As I mentioned in the previous mail, I agree the new functionality (adjusting the time by an offset instantaneously) is useful, but I'd prefer it be done initially within the existing adjtimex() interface. Then if the posix-time clock_id multiplexing version of adjtimex is found to be necessary, the new syscall should be introduced, using the same API (not all clock_ids need to support all the adjtimex modes, but the new interface should be sufficient for NTPd to use). There are some other conceptual issues this new syscall introduces: 1) While clock_adjtimex(CLOCK_REALTIME,...) would be equivalent to adjtimex(), would clock_adjtimex(CLOCK_MONOTONIC,...) make sense? Given CLOCK_MONOTONIC and CLOCK_REALTIME are both based off the same notion of time, but offset from each other, any adjustment to one clock would be reflected in the other. However, the API would make it seem like they could be adjusted independently. 2) The same issue in #1 exists for CLOCK_REALTIME/MONOTONIC_COARSE variants. 3) Freq steering for MONOTONIC_RAW would defeat the purpose of the clock_id. 4) Does adjustments to CPU_TIME clock_ids make sense? I'm guessing "no" is the right call to all of the above, but am interested if others see it differently. thanks -john -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Mon, Aug 23, 2010 at 01:41:13PM -0700, john stultz wrote: > As I mentioned in the previous mail, I agree the new functionality > (adjusting the time by an offset instantaneously) is useful, but I'd > prefer it be done initially within the existing adjtimex() interface. But the adjtimex does not support nanosecond resolution. > Then if the posix-time clock_id multiplexing version of adjtimex is > found to be necessary, the new syscall should be introduced, using the > same API (not all clock_ids need to support all the adjtimex modes, but > the new interface should be sufficient for NTPd to use). Would the new syscall need to take a struct timex? If so, I think it not worth the effort of adding a syscall. Instead, we can just add "clockid" flags into the mode field. > There are some other conceptual issues this new syscall introduces: > > 1) While clock_adjtimex(CLOCK_REALTIME,...) would be equivalent to > adjtimex(), would clock_adjtimex(CLOCK_MONOTONIC,...) make sense? > > Given CLOCK_MONOTONIC and CLOCK_REALTIME are both based off the same > notion of time, but offset from each other, any adjustment to one clock > would be reflected in the other. However, the API would make it seem > like they could be adjusted independently. You could adjust the frequency of either one. As a side effect, the other clock would also be adjusted. You can only change the time offset on CLOCK_REALTIME, and that would have no effect on CLOCK_MONOTONIC. > 2) The same issue in #1 exists for CLOCK_REALTIME/MONOTONIC_COARSE > variants. > > 3) Freq steering for MONOTONIC_RAW would defeat the purpose of the > clock_id. If I understand correctly, MONOTONIC_RAW is just access to the hardware counter? > 4) Does adjustments to CPU_TIME clock_ids make sense? Don't think so. Thanks, Richard -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Fri, 2010-08-27 at 13:24 +0200, Richard Cochran wrote: > On Mon, Aug 23, 2010 at 01:41:13PM -0700, john stultz wrote: > > As I mentioned in the previous mail, I agree the new functionality > > (adjusting the time by an offset instantaneously) is useful, but I'd > > prefer it be done initially within the existing adjtimex() interface. > > But the adjtimex does not support nanosecond resolution. As mentioned in the last mail, that's not the case. > > Then if the posix-time clock_id multiplexing version of adjtimex is > > found to be necessary, the new syscall should be introduced, using the > > same API (not all clock_ids need to support all the adjtimex modes, but > > the new interface should be sufficient for NTPd to use). > > Would the new syscall need to take a struct timex? > > If so, I think it not worth the effort of adding a syscall. Instead, > we can just add "clockid" flags into the mode field. Personally I'd add the new clock_adjtime interface, since it parallels the gettimeofday/clock_gettime() interface levels. Trying to multiplex posix clock ids via the older interface feels a little ugly. > > There are some other conceptual issues this new syscall introduces: > > > > 1) While clock_adjtimex(CLOCK_REALTIME,...) would be equivalent to > > adjtimex(), would clock_adjtimex(CLOCK_MONOTONIC,...) make sense? > > > > Given CLOCK_MONOTONIC and CLOCK_REALTIME are both based off the same > > notion of time, but offset from each other, any adjustment to one clock > > would be reflected in the other. However, the API would make it seem > > like they could be adjusted independently. > > You could adjust the frequency of either one. As a side effect, the > other clock would also be adjusted. This in most ways makes the most sense to me, since if CLOCK_REALTIME is properly freq corrected, it would seem CLOCK_MONOTONIC would as well. > You can only change the time offset on CLOCK_REALTIME, and that would > have no effect on CLOCK_MONOTONIC. But yes, this is another possibly valid interpretation. I don't prefer this one, but that doesn't make it invalid. And so with the new interface, and the possibility of multiple non-synced clocks, there are more unfortunate subtleties like this. > > 3) Freq steering for MONOTONIC_RAW would defeat the purpose of the > > clock_id. > > If I understand correctly, MONOTONIC_RAW is just access to the > hardware counter? Not exactly. Its abstracted out a step. MONOTONIC_RAW was added as there were other applications that were trying to get to raw hardware counters through various means. Unfortunately that caused portability issues. So CLOCK_MONOTONIC_RAW allows a constant freq nanosecond representation of a hardware counter. This is similar to what I'm hoping to find here with CLOCK_PTP. Is there a step out that makes this interface similarly abstracted out and easier to understand from a userland perspective? (This is the similar to Alan's critique that it needs to not be PTP specific). Additionally, I'm trying to make sure that having multiple unsynced clocks accessible from the same top-level interface isn't going to become a headache down the road API wise. If we abstract CLOCK_PTP out, doesn't it in effect just be CLOCK_REALTIME_BUTDIFFERENT? thanks -john -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
diff --git a/arch/arm/include/asm/unistd.h b/arch/arm/include/asm/unistd.h index dd2bf53..6bea0b7 100644 --- a/arch/arm/include/asm/unistd.h +++ b/arch/arm/include/asm/unistd.h @@ -392,6 +392,7 @@ #define __NR_rt_tgsigqueueinfo (__NR_SYSCALL_BASE+363) #define __NR_perf_event_open (__NR_SYSCALL_BASE+364) #define __NR_recvmmsg (__NR_SYSCALL_BASE+365) +#define __NR_clock_adjtime (__NR_SYSCALL_BASE+366) /* * The following SWIs are ARM private. diff --git a/arch/arm/kernel/calls.S b/arch/arm/kernel/calls.S index 37ae301..8a22fdd 100644 --- a/arch/arm/kernel/calls.S +++ b/arch/arm/kernel/calls.S @@ -375,6 +375,7 @@ CALL(sys_rt_tgsigqueueinfo) CALL(sys_perf_event_open) /* 365 */ CALL(sys_recvmmsg) + CALL(sys_clock_adjtime) #ifndef syscalls_counted .equ syscalls_padding, ((NR_syscalls + 3) & ~3) - NR_syscalls #define syscalls_counted diff --git a/arch/blackfin/include/asm/unistd.h b/arch/blackfin/include/asm/unistd.h index 22886cb..6671913 100644 --- a/arch/blackfin/include/asm/unistd.h +++ b/arch/blackfin/include/asm/unistd.h @@ -389,8 +389,9 @@ #define __NR_rt_tgsigqueueinfo 368 #define __NR_perf_event_open 369 #define __NR_recvmmsg 370 +#define __NR_clock_adjtime 371 -#define __NR_syscall 371 +#define __NR_syscall 372 #define NR_syscalls __NR_syscall /* Old optional stuff no one actually uses */ diff --git a/arch/blackfin/mach-common/entry.S b/arch/blackfin/mach-common/entry.S index a5847f5..252f2fa 100644 --- a/arch/blackfin/mach-common/entry.S +++ b/arch/blackfin/mach-common/entry.S @@ -1628,6 +1628,7 @@ ENTRY(_sys_call_table) .long _sys_rt_tgsigqueueinfo .long _sys_perf_event_open .long _sys_recvmmsg /* 370 */ + .long _sys_clock_adjtime .rept NR_syscalls-(.-_sys_call_table)/4 .long _sys_ni_syscall diff --git a/arch/powerpc/include/asm/systbl.h b/arch/powerpc/include/asm/systbl.h index a5ee345..e7dce86 100644 --- a/arch/powerpc/include/asm/systbl.h +++ b/arch/powerpc/include/asm/systbl.h @@ -326,3 +326,4 @@ SYSCALL_SPU(perf_event_open) COMPAT_SYS_SPU(preadv) COMPAT_SYS_SPU(pwritev) COMPAT_SYS(rt_tgsigqueueinfo) +COMPAT_SYS_SPU(clock_adjtime) diff --git a/arch/powerpc/include/asm/unistd.h b/arch/powerpc/include/asm/unistd.h index f0a1026..7d4d9c8 100644 --- a/arch/powerpc/include/asm/unistd.h +++ b/arch/powerpc/include/asm/unistd.h @@ -345,10 +345,11 @@ #define __NR_preadv 320 #define __NR_pwritev 321 #define __NR_rt_tgsigqueueinfo 322 +#define __NR_clock_adjtime 323 #ifdef __KERNEL__ -#define __NR_syscalls 323 +#define __NR_syscalls 324 #define __NR__exit __NR_exit #define NR_syscalls __NR_syscalls diff --git a/arch/x86/ia32/ia32entry.S b/arch/x86/ia32/ia32entry.S index e790bc1..8237c8d 100644 --- a/arch/x86/ia32/ia32entry.S +++ b/arch/x86/ia32/ia32entry.S @@ -842,4 +842,5 @@ ia32_sys_call_table: .quad compat_sys_rt_tgsigqueueinfo /* 335 */ .quad sys_perf_event_open .quad compat_sys_recvmmsg + .quad compat_sys_clock_adjtime ia32_syscall_end: diff --git a/arch/x86/include/asm/unistd_32.h b/arch/x86/include/asm/unistd_32.h index beb9b5f..79cbef6 100644 --- a/arch/x86/include/asm/unistd_32.h +++ b/arch/x86/include/asm/unistd_32.h @@ -343,10 +343,11 @@ #define __NR_rt_tgsigqueueinfo 335 #define __NR_perf_event_open 336 #define __NR_recvmmsg 337 +#define __NR_clock_adjtime 338 #ifdef __KERNEL__ -#define NR_syscalls 338 +#define NR_syscalls 339 #define __ARCH_WANT_IPC_PARSE_VERSION #define __ARCH_WANT_OLD_READDIR diff --git a/arch/x86/include/asm/unistd_64.h b/arch/x86/include/asm/unistd_64.h index ff4307b..3ee70cd 100644 --- a/arch/x86/include/asm/unistd_64.h +++ b/arch/x86/include/asm/unistd_64.h @@ -663,6 +663,8 @@ __SYSCALL(__NR_rt_tgsigqueueinfo, sys_rt_tgsigqueueinfo) __SYSCALL(__NR_perf_event_open, sys_perf_event_open) #define __NR_recvmmsg 299 __SYSCALL(__NR_recvmmsg, sys_recvmmsg) +#define __NR_clock_adjtime 300 +__SYSCALL(__NR_clock_adjtime, sys_clock_adjtime) #ifndef __NO_STUBS #define __ARCH_WANT_OLD_READDIR diff --git a/arch/x86/kernel/syscall_table_32.S b/arch/x86/kernel/syscall_table_32.S index 8b37293..3569859 100644 --- a/arch/x86/kernel/syscall_table_32.S +++ b/arch/x86/kernel/syscall_table_32.S @@ -337,3 +337,4 @@ ENTRY(sys_call_table) .long sys_rt_tgsigqueueinfo /* 335 */ .long sys_perf_event_open .long sys_recvmmsg + .long sys_clock_adjtime diff --git a/include/linux/posix-timers.h b/include/linux/posix-timers.h index 4f71bf4..534c12d 100644 --- a/include/linux/posix-timers.h +++ b/include/linux/posix-timers.h @@ -71,6 +71,8 @@ struct k_clock { int (*clock_getres) (const clockid_t which_clock, struct timespec *tp); int (*clock_set) (const clockid_t which_clock, struct timespec * tp); int (*clock_get) (const clockid_t which_clock, struct timespec * tp); + int (*clock_adj) (const clockid_t which_clock, int ppb, + struct timespec *tp); int (*timer_create) (struct k_itimer *timer); int (*nsleep) (const clockid_t which_clock, int flags, struct timespec *, struct timespec __user *); @@ -85,6 +87,7 @@ struct k_clock { }; void register_posix_clock(const clockid_t clock_id, struct k_clock *new_clock); +void unregister_posix_clock(const clockid_t clock_id); /* error handlers for timer_create, nanosleep and settime */ int do_posix_clock_nonanosleep(const clockid_t, int flags, struct timespec *, @@ -97,6 +100,8 @@ int posix_timer_event(struct k_itimer *timr, int si_private); int posix_cpu_clock_getres(const clockid_t which_clock, struct timespec *ts); int posix_cpu_clock_get(const clockid_t which_clock, struct timespec *ts); int posix_cpu_clock_set(const clockid_t which_clock, const struct timespec *ts); +int posix_cpu_clock_adj(const clockid_t which_clock, int ppb, + struct timespec *tp); int posix_cpu_timer_create(struct k_itimer *timer); int posix_cpu_nsleep(const clockid_t which_clock, int flags, struct timespec *rqtp, struct timespec __user *rmtp); diff --git a/include/linux/syscalls.h b/include/linux/syscalls.h index 13ebb54..f641cc5 100644 --- a/include/linux/syscalls.h +++ b/include/linux/syscalls.h @@ -314,6 +314,9 @@ asmlinkage long sys_clock_settime(clockid_t which_clock, const struct timespec __user *tp); asmlinkage long sys_clock_gettime(clockid_t which_clock, struct timespec __user *tp); +asmlinkage long sys_clock_adjtime(clockid_t which_clock, + int ppb, + const struct timespec __user *tp); asmlinkage long sys_clock_getres(clockid_t which_clock, struct timespec __user *tp); asmlinkage long sys_clock_nanosleep(clockid_t which_clock, int flags, diff --git a/kernel/compat.c b/kernel/compat.c index 5adab05..df1e469 100644 --- a/kernel/compat.c +++ b/kernel/compat.c @@ -628,6 +628,26 @@ long compat_sys_clock_gettime(clockid_t which_clock, return err; } +long compat_sys_clock_adjtime(clockid_t which_clock, int ppb, + struct compat_timespec __user *tp) +{ + long err; + mm_segment_t oldfs; + struct timespec ts, *ptr = NULL; + + if (tp) { + if (get_compat_timespec(&ts, tp)) + return -EFAULT; + ptr = &ts; + } + oldfs = get_fs(); + set_fs(KERNEL_DS); + err = sys_clock_adjtime(which_clock, ppb, + (struct timespec __user *) ptr); + set_fs(oldfs); + return err; +} + long compat_sys_clock_getres(clockid_t which_clock, struct compat_timespec __user *tp) { diff --git a/kernel/posix-cpu-timers.c b/kernel/posix-cpu-timers.c index 9829646..5843f5a 100644 --- a/kernel/posix-cpu-timers.c +++ b/kernel/posix-cpu-timers.c @@ -207,6 +207,11 @@ int posix_cpu_clock_set(const clockid_t which_clock, const struct timespec *tp) return error; } +int posix_cpu_clock_adj(const clockid_t which_clock, int ppb, + struct timespec *tp) +{ + return -EOPNOTSUPP; +} /* * Sample a per-thread clock for the given task. diff --git a/kernel/posix-timers.c b/kernel/posix-timers.c index ad72342..089b0d1 100644 --- a/kernel/posix-timers.c +++ b/kernel/posix-timers.c @@ -197,6 +197,12 @@ static int common_timer_create(struct k_itimer *new_timer) return 0; } +static inline int common_clock_adj(const clockid_t which_clock, int ppb, + struct timespec *tp) +{ + return -EOPNOTSUPP; +} + static int no_timer_create(struct k_itimer *new_timer) { return -EOPNOTSUPP; @@ -488,6 +494,21 @@ void register_posix_clock(const clockid_t clock_id, struct k_clock *new_clock) } EXPORT_SYMBOL_GPL(register_posix_clock); +void unregister_posix_clock(const clockid_t clock_id) +{ + struct k_clock *clock; + + if ((unsigned) clock_id >= MAX_CLOCKS) { + pr_err("POSIX clock unregister failed for clock_id %d\n", + clock_id); + return; + } + + clock = &posix_clocks[clock_id]; + memset(clock, 0, sizeof(*clock)); +} +EXPORT_SYMBOL_GPL(unregister_posix_clock); + static struct k_itimer * alloc_posix_timer(void) { struct k_itimer *tmr; @@ -968,6 +989,23 @@ SYSCALL_DEFINE2(clock_gettime, const clockid_t, which_clock, } +SYSCALL_DEFINE3(clock_adjtime, const clockid_t, which_clock, + int, ppb, const struct timespec __user *, tp) +{ + struct timespec new_tp, *ts = NULL; + + if (invalid_clockid(which_clock)) + return -EINVAL; + + if (tp) { + if (copy_from_user(&new_tp, tp, sizeof(*tp))) + return -EFAULT; + ts = &new_tp; + } + + return CLOCK_DISPATCH(which_clock, clock_adj, (which_clock, ppb, ts)); +} + SYSCALL_DEFINE2(clock_getres, const clockid_t, which_clock, struct timespec __user *, tp) {
A new syscall is introduced that allows tuning of a POSIX clock. The syscall is implemented for four architectures: arm, blackfin, powerpc, and x86. The new syscall, clock_adjtime, takes two parameters, a frequency adjustment in parts per billion, and a pointer to a struct timespec containing the clock offset. If the pointer is NULL, a frequency adjustment is performed. Otherwise, the clock offset is immediately corrected by skipping to the new time value. In addtion, the patch provides way to unregister a posix clock. This function is need to support posix clocks implemented as modules. Signed-off-by: Richard Cochran <richard.cochran@omicron.at> --- arch/arm/include/asm/unistd.h | 1 + arch/arm/kernel/calls.S | 1 + arch/blackfin/include/asm/unistd.h | 3 +- arch/blackfin/mach-common/entry.S | 1 + arch/powerpc/include/asm/systbl.h | 1 + arch/powerpc/include/asm/unistd.h | 3 +- arch/x86/ia32/ia32entry.S | 1 + arch/x86/include/asm/unistd_32.h | 3 +- arch/x86/include/asm/unistd_64.h | 2 + arch/x86/kernel/syscall_table_32.S | 1 + include/linux/posix-timers.h | 5 ++++ include/linux/syscalls.h | 3 ++ kernel/compat.c | 20 ++++++++++++++++++ kernel/posix-cpu-timers.c | 5 ++++ kernel/posix-timers.c | 38 ++++++++++++++++++++++++++++++++++++ 15 files changed, 85 insertions(+), 3 deletions(-)